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DATA STRUCTURE FOR SEARCH 



BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention relates to a data search system, 
and in particular to a data structure for search and a method 
of constructing the 3ame. 
5 2. Description of the Related Art 

As a data search method, there have been known a table 
search lhal uses a lable for search and a radix search lhaL uses 
a tree structure for search. The table search such as a linear 
search or a binary search has a disadvantage that it takes 

lu increasingly much time tor search with increase in tne amount 
of data to be searched . 

The radix search treats data as strinqs of symbols selected 
from a limited number of kinds of symbols and uses a trae 
sLruclure in which aseLuf Lhe airings of symbols is classified 

15 according to a sequential one of symbols starting at the top 
of each string of symbols and each node is linked to several 
others by branches. Each node linked to zero or more children 
stores pointers to zero or more children as many as the number 
of Kinds included m the strmq of symbols . provided witn searcn 

20 data consisting of a plurality of symbols, the radix search is 
carried out by following nodes of the tree from a parent to its 
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selected child according to a sequential one symbol of the givon 
search data until a target is found. 

Taking a "longest prp.fi x sftarch method as an p.vample , the 
radix search method will be described in detail. In the longest 

3 Drexix search method, each item ot data to be searched is 

represented by a fixed- length (L) or Ices string of bite and 
has its related information. When provided with search data, 
all items of data matching the search data are found and one 
item having Lhe louyesL malch amuuu Lhem is selected ab a Laryel. 

10 In the case where each item of data to be searched and 

its related information as shown in Fig. 1, a binary tree in 
which the data is stored is shown in Fig. 2. For example, when 
search data "00 01111" is provided, the search ic carried out 
fcy following nodes in the order presented; Nl, N2, N3, N4, no, 

15 and N7 while reading the related information from a node having 
a node information flag set (indicated by blacked circle) , In 
this nasp. f at the nodp.fi N3 r N6, and N7, th« nndA Information 
flag is set. In other words, the item of data stored in each 
ot these nodes N3 , N6, and N7 matches the search data. Among 

20 them, the related information "D" stored in the node N7 having 

the longest match (the lowest level) is obtained as a tarqet. 
Assuming that the time required to follow one node is T , 
Lhe maximum btidrch Lime is 6T in the case of Fig. 2, 

Tn order to rpdnr.P rhp dppth nf a trftfl to shorten 
25 the maximum search time, N-ary tree structure (N > 2) is u3UQlly 

employed. In the case of N=4 (quad tree structure) , the search 



200 1 01/25 THU 13:50 FAX 03 3288 3222 Ktouragi Patent OSTROLENK 



B 005/059 



FQ5-512 < 

data is sequentially read in units of tvo bits. Accordingly, 
an item of data consisting of an odd number of bits is expended 
to data having a bit length of an integral multiple of 2 so as 
to meet the quad tree structure, For example, 5 bit data "00011" 
5 having the related information U D" is expanded to two items of 
6-bit data* "000110" and "000111*, each having the same related 
inf ormaLion ^D", as shown in Fly. 3. 

In the case where each item of expanded data to be searched 
and its related information as shown in Fig. 3, a quad tree in 

1(1 whir.h thA fiat a 1s stored 1s shown 1n P1g. 4. Each node stores 
data haying a format composed of a node information flag field 
kg, a pointer field having four pointers each corresponding 
branches: u 00", "01", u 10", and "11* to its child nodes, and 
cl related information field. The node inf urmaLiun flay FG is 

15 set to 1 when storing data and to 0 when storing no tfritfl. In 
the node N2, for example, the node information flag FG is set 
to 1. The respective tour pointers each corresponding to M UU" , 
"01", "10", and "11" indicate N3 , N4, N5 and NULL , which means 

thai the node nz has three corresponding branches to the 
20 child nodes N3, N4 , and N5 and has no branch for "11" . The node 

N2 s Lures Lhe related inf uima Lion A. 

For example, when search data "0001111" is provided, th^ 

search is carried out by following nodes in the order presented; 

nl N'l , N4, ana N7 while reading the reiatea information 
25 from nodec N2 , N4, and N7 each having the node information flag 

FG set to 1 (indicated by blacked circle). In this case, the 
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item of data stored in each of these nodes N2 , N4 , and N7 matches 
the search data "OQOllll"- Among them, the related information 
"D" stored 1n the. node- N7 having thft longest matnh (fhp. lowest 
level) is obtained as a target. Assuming that the time required 
5 to tollow one node is t. the maximum search time is 4T in the 
case of Pig, 4. Compared with the binary tree as shown in Fig, 
2, the quad tree reduces its maximum search time to two -thirds 
the mavimum search time of the binary tree. 

Huwevej. , if N fur N-aLy tree is sel Lo a large number so 

10 as to shorten the maximum search time, then the nnmbp.r of nnrip.s 
having the same related Information is increased by expanding 
the aata to meet the N-ary tree, in addition, since the size 
of one node and the number of pointers included therein are also 
increased in proportion to N, an increasing amount of memory 

15 is needed. 

For example, in the case of each node storing 1-bit node 
information flag FG, four 6-bit pointers, and 8-bit related 
information, each node needs 21 bits (-1 + 6x2-1-8) for the 
binary tree and I'd bits (=1 + 6x4 + 8) for the quad tree. 

20 The total of bite needed in the binary tree ae shown in Fig. 
2 is 21 (bits) x 15 (nodes) = 315 bits and the total of bits needed 
in the quad tree as shown in Pig. 4 is 33 (bits) x 14 (nodes) = 
462 bits. Assuming that the length of bits per address in 
a mpmnry stnH ng thft data to hft sftarchfiri 1 s 32 hi ts , 1 5 addresses 

25 are needed for the binary tree but 28 addre33B3 for the quad 
tree. 
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as described above , according to a conventional N-ary tree 
data structure, the number of nodes having the same related 
information is increased by expanding the data to meet f hfi N-ary 
tree. In addition, since the size of one node and the number 
: 5 of pointers included tnerein are also increased m proportion 
to N, an increasing amount of memory is needed. 



SUMMARY OF THE INVENTION 

An object of the present invention is to provide a data 
search system and a data structure allowing the necessary amount 

10 of memory to he reduced while maintaining or improving search 
performance, compared with the radix search method. 

According to the present invention, a data structure in 
which items of data are stored for search, includes: a tree 
structure in which the items of data are stored except for a 

15 portion of the items of data corresponding to a sub -tree 
structure, which is a selected portion of an assumed tree 
structure formed by all the items of data; and an equivalent 
table storing the portion of the 1 tenia of data. In table form. 
Th* tree* structure may include a plurality of nodes, each 

20 of which is composed of a node information flag, a plurality 
of pointers each corresponding to predetermined brancnes, ana 
related information, wherein each of the pointers indicator; one 

of its child node, the equivalent table, and null. The 
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equivalent table may include a plurality of entries, each of 
which is compused u£ a Lable nude iufoiaualiuu flay, a La.il entry 
flag, a data bit string, a search bit length, and related 
information . 

5 The data bit string may be arranged so that a length of 

the data bit string is equal to that of search data, wherein 
the search bit length indicates a length of an original data 
bit string to match with the search data. The entries in the 
equivdleuL table may be stored at consecutive locations in 
10 a memory. 

The sub -tree structure may be selected so as to satisfying 
the following conditions?: a) an amount of memory required to 
store the data structure is smaller than that required to store 
the assumed tree structure; and b] search performance of the 
15 data structure is not lower than that of the assumed tree 
sir uc Luj:ti . 

According to another aspect of the present invention, 
a method for constructing a data structure in which items of 
data are stored for search, includes the steps of: a) forming 
20 an assumed tree structure in which all the items of data arc 

stored; b) sequentially selecting a node from tne assumed tree 

structure to select a sub-tree structure designated by the 
selected nude; u) furminy an equivalent Lable btuxiiiy a puxtion 
of thp. itp.ms of rlafa r.nrrftspnnfl i ng to the seleotp.H snb-free 
25 structure in a table form; d) determining whether the selected 
sub-tree structure satisfies the following conditions: 1} an 
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amount of memory rAqnirftd to 55 tor a a data structure including 
the equivalent table in place of the selected sub tree structure 
is smaller than that required to store the assumed tree 
ctruoturc; and 2} eearch performance of the data structure is not 
5 lower than that of the assumed tree structure; and e) when the 
selected sub-tree structure satisfies the conditions (1) and 
(2), replacing the selected sub-tree structure with the 
equivalent table to constraint thft data structure. 

The condition (I) may be that, when the selected 3Ub 

](} tree structure is replaced with the equivalent table to form 
a new data structure, a maximum search time Tmax_t calculated 
from the new data structure does not exceed a maximum swaich 
time Tmax calculated from the assumed tree structure . The 
condition (2) may be that, when the selected sub- tree structure 

1^ i s rp.pl arftd with the equivalent table to form a new data 
structure, a nece33ary amount of memory for the now data 

structure is smaller than that for the assumed tree structure. 

A decision on whether the conditions (1) and (2) may be 
satisfied is made depending on whethex the following equation 

20 is satisfied: 

N D ~ N L X K, when K = Te/Tn, 
whe.rft N D 1 s the number of items of data included in the selected 
sub- tree structure, N L i3 the number o£ levels of the 
selected node or lower m the assumed tree structure, Tn is 

25 search time per node, and Te is search time per entry in the 
equi valen I table . 
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According to further another aspect of the present 
invention, an apparatus for constructing a data structure in 
which items of data are stored for search, includes: a tree 
formation section for forming an assumed tree ctruoturc in which 

5 all tne items of data are stored; a node selector for 

sequentially selecting a node from the assumed tree structure 
Lu selecL a sub- tree ^Lxucluiy desiynaled by Lhe stilfciuLed node, 
forming an equivalent table storing a portion of the i terns of 
data corresponding to the selected sub-tree structure in a table 

11) form, and determining the selected sub-tree structure when it 
satisfies the following conditions; 1) an amount of memory 
required to store a data structure including tne equivalent 
table in plac© of the selected sub-tree structure is smaller 
Lh<an LhdL required Lu sluxe Hie assuuifcid Liew sLluCtuie; and 2) 
KP.*rr.h performance of thfi data structure is not lower than that 
of the assumed tree structure; and a data structure formation 
section for replacing the selected sub-tree structure 
satisfying the conditions {1} and (2) with the equivalent table 
corresponding to the selected sub-tree structure to construct 

20 the data structure. 

According to still another aspect of the present invention , 
a search system includes: a memory storing a data structure in 
which items of data are stored for search, the data structure 
comprising: a tree structure m which the items of data are 

25 ctorod oxocpt for a portion of the itemc of data corresponding 
Lu a sub-Liee sliucturc;, which is d aelecLed yuiliou of au 
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assumed t.rftft structure formed by all th« 1 tarns of data; and an 
equivalent table storing the portion of the items of data m 
table form; and a search section for searching the data structure 
for an item of data matching input search data, 
5 The search system may further include: a tree formation 

section for forming an assumed treo structure in which all the 
items of data are stored; a node selector lor sequentially 
selecting a node from the assumed tre.fi structure to .qp.lpct a 
sub-tree structure designated by the selected node, forming an 

lu equivalent table storing a portion ot the Items of data 

corresponding to tho selected cub-tree structure in a table form, 
ana determining tne selected sub-tree structure when it 
satisfies the following conditions: 1) an amount of memory 
required lo store a data structure including the equivalent 

IS tahlp. in plarp. of the selected sub-tree structure is smaller 
than that required to store the assumed tree structure; and 2) 
search performance of the data structure is not lower than that 
of the assumed tree structure; and a data structure formation 
section for replacing the selected sub-tree structure 

20 satisfying the conditions (1) and (2) with the equivalent table 
corresponding to the selected sub-tree structure to construct 
thA data struof-nrA that 1s s tor Ad 1n the memory. 

According to another aspect of the present invention, a 
storage medium for used in a search system, storing 

25 computer-readable items of data to bo searched in a data 

slJiucluie, Lhe dala bliucluie cuiupxisiiiy ; a Lxee sUuclure in 
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which the items of data are stored except for a portion of the 
items of data corresponding to a sub- tree structure, which is 
a selected portion ot an assumed tree structure termed by all 
the items of data; and an equivalent tabic storing the portion 
5 of the Items of data in table form. 

According to another aspect of the present invention, a 
slur aye medium sloriiiy a computer-readable pxuyxaia £ul 
r.nnstn.icting a data structure in which items of data are stored 
for search, the program includes the steps of; a) forming an 
10 assumed tree structure in which all the items ot data are stored; 

b) sequentially selecting a node from the assumed tree structure 
to select a sub-tree structure designated by the selected node; 

c) forming an equivalent table storing a portion of the items 
uf daLa co-Lrespundiny Lo the selected sub-tree structure in a 

1.S tab! a farm; d) determining whether t.fift «!ftlftr:tftd suh-t.rftft 

structure satisfies the following conditions; 1} an amount 
of memory required to store a data structure including the 
equivalent table in place of the selected eub-tree structure 
is smaller than that required to store the assumed tree 

20 structure; and 2) search performance of the data structure is not 
lower than that of the assumed tree structure; and e) when the 
SftlAfitArt sub- t re a structure satisfies thA conditions (1) and 
(2), replacing the selected sub-tree structure with the 
equivalent table to construct the data structure. 

25 Ae described above, the equivalent table stores a 

plurality uf entries without the need uf poiaLexa to child nudeb 
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as in the case of a tree structure, resulting in reduced amount 
u£ memury xequiied Lu a Lure Lhe da Lei a true Lure. 

Further, a node to be replaced with an equivalent table 
is selected on condition that, if the node is replaced with the 
equivalent table, then the maximum search time for a new data 
structure including the equivalent tabic is not longer than the 
caicularea maximum search rime and the necessary amount 
of memory is reduced. In other words, when the number of items 
of data existing at the nude or lower in the assumed tree 
structure is sufficiently small, the node is replaced with the 
equivalent table. Accordingly, the present invention provides 
a search system allowing the necessary amount of memory to be 
reduced without decreasing in performance compared with the 
conventional radix search method. 

IS RRTF.F DF.SO.ft 7PTTON OF THF DRAWINGS 

Fig. 1 is a diagram snowing an example ot a set or data 

to be searched and corresponding related information ; 

Fig . 2 is a diagram showing a conventional binary tree 
structure in which the data to by searched are stored; 

20 Fig. 3 is a diagram showing a set of expanded data to be 

searched and corresponding related information, wnerem the 
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expanriftd data arft obtained by Axpanding thA data as shown m 
Pig. 1 to meet a quad tree; 

Fig* 4 is a diagram showing a conventional quad tree 
structure in which the expanded data to be searched are stored; 

5 Fig. 5 is a schematic block diagram showing a data search 

systftrn that imp! flmp.nts a data structure accorcH ng to fh« present 
invention ; 

Fig. 6 is a diagram showing an example of a data structure 
for search according to an first embodiment of the present 
10 invention; 

Fig. 7 is a flow chart showing a search operation using 
the data structure according to the first embodiment; 

Fig. 8 is a schematic block diagram showing a data search 
system having a data construction section according to the 
15 present invention: 

Fig. 9 is a diagram showing an example of a set of data 
to be searched and corresponding related information ; 



Fig. 10 is a flow chart showing a data- structure 
construction operation according to an second embodiment of the 
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present invention; 

Fig. 11 is a diagram snowing an example of a tree structure 
formed in a tree formation section according to the second 
embudiuieul u£ the piesenL iuveuliuu; aud 

5 Fig- 12 is a diagram showing another ftxample of a data 

structure for search constructed by the second embodiment of 
the present invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Referring to Pig. 5, a search system is composed of a 
lii computer 11 , an input device 21 such as a keyboard ana a pointing 
device, and an output device 31 cuoh as a monitor. The computer 
11 implements a search section 12 by running a program thereon. 
A memory 13 is a main memory for a processor to store a data 
structure according to the present invention « 
•[ S Referring to Kig. 6, a data structure a ccnrrl i ng to a first 

embodiment of the present invention is formed by an equivalent 
raDie ana a quad tree usca in the radix search, assuming 

the maximum bit length L = 3 bits. In this example, the data 
structure as shown in Fig. 6 is obtained by replacinq the node 
20 N2 and its child nodes N3 -N7 as shown in Fig . 4. with the equivalent 
table N2_t . In other words , a set ol data represented by the node 
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NP. that, is, all thA data inolndAd in a sAf of hit strings p.ar.h 
starting with "00", are 3tored in the equivalent table N2_t . 

As shown in Fig. 6 > each of the nodes Nl , N3 , N9-N14 s Lores 
information having a format composed of a node information flag 
5 field FG , a pointer field having four pointers each 

corresponding branches! "00", "01', "10", and "II" to its 
child nodes, and a related information field. The node 
information flag ?G is sat to 1 whfin storing data to ha sp.arnhftd 
and to 0 when storing no data to be searched. 

iu in the node Nl, tor example, the node miorrnation tlag 

FC is act to 0 because the node Nl does not store the related 
information correspondinq to data to be searched. The 
respective four pointers each corresponding to 11 00" , "01" f "10", 
and 11 11" indicate the equivalent table N2_t, null, null, and 

15 the. nodA Nfi . Sinr.e the. node Information flag FG is set to 0, 
the node Nl stores an invalid value " - n in the related 
information field thereof. 

In the node N8, for example, the node information flag 
FG is set to 0 because the node N8 does not store data lo be 

20 searched. The respective four pointers each corresponding to 
branches "00" , "01", "10", and n ll" indicate NULL , the node N9 , 
NULL, and thA node Nl 0 . Since the nodft Information flag FG is 
set to 0, the node NS stores an invalid value in the related 
intormation Held thereof. 

25 In the node Nil, for example, the noda information flag 

FG is seL Lo 1 because the uude Nil s lures dala lu be beaiuhed 
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(here, "110100" ) . All four pointers each corresponding to "OO", 
"01", "10", and "11" indicale NULL because Lhe node Nil Is a 
lMf mode- Since the node Nil stores the data "1101OQ", the 
related information "E" is stored (see Fig. 3). 
5 Trie equivalent taDie HZjt stores as entries the items of 

data to be searched corresponding to the node N2 and its 
child nodes N3-N7 as shewn in Fig. 4, Here, the respective 
entries are located as entries at consecutive addresses in 
Lhe memory 13. Each of Lhe entries stored in the equivalent 

10 table N2_t has a format composed of a 2 -bit table node 

information flag field FG , a tail entry flag field, a data bit 
String field, a search bit length field, and a related 
information field. 

A 2 -bit table node information flag FG is set to "2" for 

15 all th© entries of the table N2_t. A tail entry flag is set to 
"1" for lhe last enLiy and to "0" fox Lhe other tm tiles. A bLriiiy 
of data bits for panh pntry is fnrmpri by adding onp. or mnrp bit.? 
"O" to the corresponding item of data to be searched so that 
the olt length o± tne corresponding item of data is equal to 

20 the maximum bit length L (L = 8) of search data. The search bit 

length indicates the number of bits to be searched from tne 
topmost bit. The related information field stores related 
information corresponding to the item of data to be searched. 

Sf.ASCH OPERATION 

25 In the above- described data structure, when branching to 

a node having a node information flag ot *o" or "1", the search 
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i r r.arrip.d nut hy f ol 1 owing the trp.e of node.? while sequentially 
checking the search data in units of two bits as the case of 
Fig, 4, When branching to one having tne node information flag 
of "2", the control ie shifted to the equivalent table N2_t and 
5 the search operation is carried out by simply searching the 
equivalent table N2_t for an entry matching the search data. 
The details of search operation will be described hereafter, 
Pftferring to Fig. 7, vhp.n inputting search data through 
the input device 21, the search section 12 reads the information 

lu of a root node trom the memory 13 storing the aata structure 
as shown in Fig. 6 (step Al). Then, the ccarch section 12 
determines whether the node information flag FG of a node in 
question is w 0" or "1" (step A2) . When the node information flag 
FG is w l", the search section 12 stores the related information 

15 stored in the node and a length of matching bits into a memory 
(not shown) {step A3). Here, a length of matching bit3 is 
dependent on the level of a node in question. For example, m 
a node at a level NL , a length of matching bits is 2 x (NL - 

1) ■ 

2u After the step A3 or when the node information flag FG 

is u O", a pointer corresponding to the first 2 bits of the search 
data is selected from the tour pointers (step A4) and it is 
determined whether the value of the selected pointer is NULL 
(step ao 5. if the value of the selected pointer is NULL (YES 

25 at st&p AS ) , then the ssarch section 12 selects the 

longest mat china outs fx urn Lhe related iuf uxmaLiun stored and 
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outputs 1 1 : as a search result to the output device 31 (stftp 
and then the 3earch operation is ended. 

If the value of the selected pointer is not null (no at 
step A5) / then the search section 12 inputs the information of 
5 a branch indicated Ihe selected yoinLer (sley A6). hi Liils 
stage, it cannot be Jtnown if the branch indicated, by the selected 
pointer is a node or the first entry of the equivalent table. 
Therefore, the search section 12 accesses a selected number of 
addresses of the memory 13 starting with the addre33 indicated 

10 by the selected pointer using one which of the number of 

addresses required to store data for a node and the number of 
addresses required Lo s Lure data fur an entry of the equivalent 
table is greater than the other. In the case where the number 
of addresses required to store data for a nodfe is "2" and 

!:> the number of addresses required to store data tor an entry of 
the equivalent table is u i", the search section 12 accesses the 
two addresses of x and x+l, where the address x is indicated 
by the selected pointer. 

Alter the information of a branch indicated by the selected 

7.0 pointer is inputted (step Afi), it is determined whether a norlp- 
information flag IPG is " 2" (step A7} . When the node information 
flag Jr'G is not n 2" (NO at step A7), the control goes bacx to 
the step A2 and the search is carried out by following the tree 
as indicated by the steps A2-A6. 

25 When the node information flag FG is "2" (YES at step A7 ) , 

it means that the branch is the equivalent table N2_t . 
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Accordingly, the. sfiarnh sp.nfinn 12 dp.tp.nninp..<; whp.thp.r the data 
bit string of a selected entry in the equivalent table 

isz_t matches the search data with respect to the search bit 
length thereof (stop AS). 
5 When the data bit string of a selected entry in the 

equivalent table N2__t matches the search data with respect to 
the search bit length thereof (YES at step A3), the search 
section 1? storp.fi the* related information stored in thp. P.nfry 
in question and a length of matching bits (equal to the 
11) corresponding search bit length) into the memory (not shown) 
(step A9) . 

Arter tne step as or when tne data bit string of a selected 
entry in the equivalent table N2_t does not match the search 
data wlLh respect to the search bit length thereof (NO at step 

IS ) , it is dAt^rminAfi wh^th^r thft ^ntry in question ifi the. tail 

entry (step A10 } - When the entry in question is not the rail 
entry Cnu at step Alu), the subsequent entry is selected (step 
All) and then the steps A3-A10 are repeatedly performed until 
the tail entry. When the entry in question is the tail entry 

2U (YES at step A10), the search section 12 selects the 

longest matching one from the related information stored and 
outputs it as a search result to The output device 31 (step A12) 
and then the search operation is ended. 

The search operation will be described taking the data 

25 structure as shown in Fig, 6 as an example. 

Assuming that seaxch data "00011111" is input Lhxouyh the 
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input device 21, the search section 12 reads the information 
of the root node Nl from the memory 13 3toring the data structure 
as shown in riq. 6 and then determines whether the node 
information flag FG of the node Nl is "0" or n l" (stops Al and 
5 A2). Since the node information flag FG is "0" and lh« II-lsI 
2 hits of thp. searnh data is w 00", the pointer for "00" 
corresponding to the first 2 bits of the search data is selected 
(step A4] . Since the pointer for "00" indicates the equivalent 
tabic N2_t, the ccarch section 12 inputs the information of the 

10 first entry stored m the equivalent table N2 t (step A6K 

Sine© th© nods information flag FG of ths first: entry is 
u 2" (YES al step A7), the search section 12 deteiutiueb whether 
Lhe data bit string of the first entry in the equivalent table 
N2_t matches the search data with respect to the search bit 

16 length thereot (step A8). 

In thic example, the data bit string of the first entry 
is "00000000" and the search bit length thereof is "2". Since 
the first 2 bits of the search data "00011111" matches that of 
the data bit string "00000000" (YES at step A3), the search 

20 sftcf 1 on 1 7 storAfi the reJ atftd Informati on "A" stored i n rhp. f i r.qr 
entry and a length of matching bits m 2" into the memory (not 
shown) (step Ay). Thereafter, the second entry or the 
equivalent table N2_t is selected (step All) and then the steps 
A3-A10 are repeatedly performed while sequentially selecting 

25 the entries of the equivalent table N2_t until the entry in 
question reaches the tail entry. Here, in the cases of the third 
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and fourth entries, the step A8 determines that the first 2 bits 
of the scorch data "OOOlllll" matches that of the data bit string . 
Therefore, a pair of the related information "C" s for Ad in the 
third entry and a length of matching bit 3 "4" and a pair of the 
5 related information "D" stored in the fourth entry and a length 
of matching bits "5" are- stored in the memory (not shown) (step 
A9) . 

When the entry in question reaches the tail entry (YES 
at step A10), the search section 12 selects the related 

10 information "D M having the longest matching length and outputs 
it a3 a search result to the output device 31 (3tep A12). 

in this embodiment , a pair of the related information and 
the search bit length is stored in the memory (not shown) at 
Hit* slept? A3 and A9 . 

1 5 Alternatively, the equivalent table N2_t may be previ gits') y 

sorted in ascending order of search bit length. In this case, 
only the related information is stored in the memory (not shown) 
at the steps A3 and A9 and the related information stored latest 
is output as a search result to the output device 31. 

20 The major portion of search time is occupied by memory 

access time. Accordingly, it can be assumed that the time 
rAqnired to access nnp ah try of thA AqnivalAnt tab! a N?_r is 
substantially equal tc the time T required to follow one node 
or the tree, m the case where a search result is obtained from 

25 the equivalent table N2__t, the maximum eearch time ie 5T that 
ia ublalned by aQdiny Ihe Lime T required to follow the node 
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Ml anH thA firnp. 4T rsqn i r Ad to sfi^rnh all fintn'AS of thp 
equivalent table N2__t . In the case where the equivalent table 
N2 t is not accessed but one of the nodes N11-N14, the maximum 
search time ie 4T as described before . Therefore, the maximum 
5 search time of the embodiment is 5T. 

As described above, the equivalent table stores a 
plurality of entries without the need of pointers to child nodes 
as in the case of a tree structure, resulting m reduced amount 
of memory to store the data structure. For example, the 3ize 

10 of the data structure of the present embodiment as shown in Fig. 
6 ic smaller than that of the conventional data structure as 
shown in Fig. 4. Each node in the data structure according to 
the embodiment needs a total of 34 bits, which is greater than 
that o£ the conventional data structure as shown in Fig. 4 by 

15 1 ni t because a node information flag FG for each node in the 
embodiment is 2 bits while that of the conventional data 
structure is 1 bit . In the case where each entry in the 
equivalent table is composed of 2-bit node information flag FG, 
1-biL tail enLry flag, 3-bit data bil string (L=8), 4-bit search 

20 data length, and 8-bit related information, a total of bits for 
each entry in the equivalent table is 23. Accordingly, when 
6 nodes ana 4 entries are stored as shown m Jt'ig. b, a total 
of bite in the data structure are 365 ( =■ 34 x 8 + 23 x 4). 

in contrast, since each of 14 nodes needs 33 bits in the 

25 conventional data structure as shown in Fig, 4, a total of bits 
is 462 (- 33 x 14). Therefore, compared with the conventional 
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data sfnic;f^rA # thA prASAnt AmbndimAnt allows thA si7ft of data 
structure to reduce by about 22%. 

Further, the present embodiment can reduce the number of 
addressee to store the data structure. Assuming that the number 
5 of bits for each address is 32, the conventional data structure 
as shown in Fig_ 4 needs 23 addresses because two addresses 
are needed to store data for each of 14 nodes , 

Contrariiy, according to the present embodiment, 
the number of nodes is 8, the number of entries in the equivalent 

10 table is 4, tne necessary number ot addresses to store data for 
one node ic 2, and the necessary number of addresses to store 
data for one entry is l. Accordinqly, the necessary number of 
addresses to store the data structure as shown in Fig. 6 is 20 
( = 2x8 + 1x4). Therefore , compared with the conventional data 

\5 structure , the present, embodiment al 1 owa the number of addresses 
in the memory to reduce by about 20%. 

DATA STRUCTURE CONSTRUCTION 
Referring to Fig. 8, a search system is composed of a 
cuiupulei. 41 , an input device 51 such as a keyboard and a poinliuy 

70 device, an output device 61 such as a monitor, and a memory 71. 
which arc connected to the computer 41. The computer 41 
implements a data construction section 42 and a search section 
43 by running corresponding programs stored in the memory 71 
such as a nonvolatile semiconductor memory , a aisx, or the like. 

25 A memory 44 is a main memory for a processor to store data to 
be searched and corresponding related Information, which are 
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used to construct a data structure according to the present 
invention . 

The search section 43 has the same functions as the search 
section 12 as shown in Fig. 5. The data construction section 
:j 42 includes a controller 421, a tree lormation section 422 
i ncl nd1 ng a route poi nro.r r^gi star 423 f a rp.pl acad node* flAlAnfnr 
424, and an equivalent table formation section 425. The 
rj controller 4^i controls the operation 01 the components m the 

,rt data construction section 42. 

j{ 10 The tree formation section 422 has a function of formlnq 

O a tree structure (here, quad tree) used for radix search on 

-\I the memory 44 based on the data Lo be searched and the 

fl r.orresponding related information, which are stored in 

L s the memory 44. 

^ Li The replaced node selector 4^4 has a runction of 

H calculating the maximum search time in the tree structure formed 

by the Lree formation sec Lion 422 and further a function of 
selecting a node to be replaced with an equivalent table on 
condition that , if the node is replaced with the equivalent table , 
20 then the maximum search time for a new data structure including 
the equivalent table is not longer than the calculated maximum 
search time and the necessary amount of memory is reduced. 

The equivalent table formation section 42 5 has a function 
ul fo-Liuiuy da equivalent table from the tree structure of the 
?.S qpipr.te.d node or lower so that the equivalent table stores all 
items o£ data to be searched included in a set of data, indicated 
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by trie selected node to be replaced with the equivalent table 
and a function of replacing the tree structure of the 
selected node or lower with the equivalent table. 

Hereafter, the data structure construction operation will 
5 be described by referring to Figs. 9-12. 

ItefArring to Pig. 10, whAn 1*hA nnntrol 1 <=>r 421 i ns tmnts 
the tree formation section 422 to form a tree data structure 
for radxx search, the tree formation section forms a tree 
data structure as shown in Fig. II on the memory 44 based on 
10 the data to be searched and corresponding related information 
as shown in Fig. 9 (step Bl). 

In Fly. 11, Llie iuf uruialiun btuied in each of Llie nude* 
N1-Nn is similar to that- in each of the nodes NT -Nl d as shown 
in Fig* 4. 

i.s Tnerearter , tne tree tormation section Ml stores a 

pointer to the root node Nl in the root pointer register 423 
(step B2). When the tree formation operation of the tree 
formation section 422 has been completed, the controller 421 
starts the replaced node selector 424. 

20 The replaced node selector 424 calculates a maximum search 

time Tmax for a tree structure of the root node Nl indicated 
by the pointer stored m the root pointer register (step 
E3) , The maximum search time Tmax is calculated by Tn x NLmax, 
where Tn is the time required to follow one node of the Lxee 

25 and NLmax is a level of a lowermost node in the tree. 

Further, the replaced node selector 424 sequentially 
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selects the nodes of the tree structure as shown in Fig. 11 
according to a predetermined rule- For example, the nodes of 

Fiq. 11 are sequentially selected in ascending order of level 
and, whan two or more nodes are found at the same level, 
5 Ihese nodes axe sequentially selec Led in brauch order presented; 
"no" , "01". "10", and "11". 

The replaced node selector 424 determines whether a 
sequentially selected node satisfies the following conditions: 

a) when the sequentially selected node i3 replaced with an 
10 equivalent table to form a new data structure, the maximum 

search time Tmax_t calculated form the new data structure 
Is not loiigai Ihan the maximum search time Tmax calculated 
at the step B3; and 

b) when the sequentially selected node is replaced with an 
15 equivalent table to form a new data structure, the necessary 

amount o£ memory for the new data structure ic smaller than 
before the replacement, 

A decision on whether a sequentially selected node 
satisfies the condition (a) is made depending on, for example, 
7.0 rhp. f o.l lowing equation: 

N D ^ N Tf >< K, when K = Te/Tu, 

where N D is the numoer ot items ot data to De searcnea (wnicn 
le the same ac thar of entries in the equivalent table) included 
in a set of data deslqnated by the sequentially selected node, 
25 is the number of levels of the sequentially selected node 

or lower In the tree structure, Tn is the search time per node, 
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and Te is the search time per entry in the equivaient table. 
If N D =t N L X K for a selected node, then it is determined that 

the selected node satisfies the condition (a) . 

When having found a nods satisfying both the conditions 
5 (a) and (b), th^ replaced nude seleotui 424 output* a pointer 
r indicating the found node to the equivalent table formation 
section 425 (step B4) . 

When the replaced node selection operation ot tne 
replaced node selector 424 has "been completed as described above, 
U) the controller 421 starts the equivalent table formation section 

425. 

Upon receipt of the pointer r indicating the found node 
from the replaced node selector 424, the equivalent table 
formation section 425 releases a tree structure of the found node 

15 or lower. Thereafter, the equivalent table formation section 
425 searches the memory 44 for a collection of data to be searched 
and corresponding related information, which are included in 
a set of data designated by the found node, and forms an 
equivalent table from the found collection of data and 

20 corresponding rAlafftd Information (step RS). Than, tha 

equivalent table formation section 425 store3 the equivalent 
table formed at the step into the location indicated by the 
pointer r on the memory 44 (step B6). 

The controller 421 may repeatedly start the replaced node 

25 selector 424 and the equivalent table formation section 425 
arbitrary times. In the case where the replaced node selector 



'2001 01/25 THU 13:57 FAX 03 3288 3222 Ktsuragi Patent -* 0STR0LENK 



1029/059 



FQ^-Sl? 27 

424 is started by the controller 421 two or more timAs, a 
previously selected node i3 canceled before performing the step 
B4. 

Referring to Fig. 12, the tree structure of the node N6 
5 ox lower is xeplaced wiLli an equivalent lable N6_L . This data 
structure is constructed from the tree structure formed by the 
tree formation section 422 by operating the replaced node 
selector 424 and the equivalent tabie tormation section 425 
once , 

10 An example of the data structure construction operation 

will be described, taking the case of Tn - Te - T, that is, K 
- Te/Tn - 1 as an example, where Tn is the search lime pel node, 
and Te is the search time per entry in the equivalent table. 

The tree formation section 422 forms a tree data structure 
as shown in Fig. 11 on the memory 44 and stores a pointer to 
the root node Nl in the root pointer register 423 (ctepc Bl and 
B2). Thereafter, the replaced node selector 424 calculates 
a maximum search time Tmax for the tree structure as shown in 
Fig. 11 (step B3). Since the lowermost level of the tree 

2U structure as shown 1 n F1g. 11 1s "4", fhA maximum sAarnh HmA 
Tmax for the tree structure is 4T. 

subsequently, the replaced node selector 424 sequentially 
selectc the nodes starting with the root node Nl according to 
breadth-first search. If the node Nl is replaced with an 

25 equivalent tabie, then the number of entries in the equivalent 
table is 8. Therefore, the maximum search time Tmax_t required 
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to search a new data structure including the equivalent table 
ic ST. Since Tmax_t = 8T cxoccdc Tmax - 4T, this node NX docs not 
satisfy tne condition (a), This causes the replaced node 
selector 424 to select a subsequent node N2. 
J I£ Ihu node Nl lb replaced with au equivalent lablw, Llieii 

ThA nnmhp.r of ftntrip.?; in thA AqnivaTent tab! a i p; 4 _ Thp.rp.fnrfi, 
the maximum search time Tmax_t required to search a new data 
structure including tne equivalent table is st. which is 

obtained by adding the search time T for the node NI to the search 

10 time 4T for tne 4-entry equivalent table, since Tmax t = 5T 
oxcaods Tmax = A.T , this nodo N2 does not satisfy the condition 
(a). This causes Llie replaced nude seleclur 424 Lu select 
a next node N6 because the breadth-first search schAmp is 
employed, 

15 If tne node No is repiacea witn an equivalent tame, men 

the number o£ entries in the equivalent tabic is 2. Therefore, 
the maximum search time Tmax_t required to search a new data 
structure including the equivalent table is 3T, which is 
obtained by adding the search time T for the node Ni to the search 

211 time ?T for the 7 -entry equivalent table. Since Tmax_t = 3T is 
smaller than Tmax = 4T, this node N6 satisfies the condition 
(a) . 

Furthermore, in the case where each node needs 34 bits 
and each entry of an equivalent table needs 23 bits, the release 
25 of the nodes N6 and its child nodes N7-N9 reduces the number 
uf bits by 136 {- 3 4 x 4) and the number of bits required for 
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the equivalent tadle N6_t is 46 (= 23 x 2) . Therefore, tormation 
of the equivalent table N6_t achieves a reduction of 0 0 bits 

(= 135 - 46), which means that the node N6 also satisfies the 
condition (b) . The replaced node selector 424 selects the nod© 

J N6 mid Outputs d pointer Lo the node N6 to the equivalcuL table 
format -ion s p.nr inn 

Upon receipt of the pointer to the node N6 from the 
replaced noae selector 4^4. tne equivalent taaie tormation 
section 425 releasee; a tree structure of the node No and its 

10 child nodes N7-N9. As shown in Fig. 11, a set of data desiqnated 
by the node N6 consists of all the bit strings starting with 
"10" . The equivalent, table luJ-iuatiun section 425 t>earulies 
the memory 4 4 for a collection of data to be searched and 
corresponding related information, which are included in the 

15 set oi data designated by tne node N&. 

In this example, the memory 44 stores the data to be 
searched and corresponding related information as shown in Fig. 
9. Therefore, the equivalent table formation section 425 finds 
two pairs of data, (1000, E) and (10000, F) , as a. collection 

20 of data to be searched and corr&sponch ng related Information, 
which arc included in the 3ct of data deaignated by the node 
No. The equivalent taule formation section 425 forms data of 
entries for the equivalent table N6_t from the found pairs of 
dcila (1000, E) and (10000, F) , respectively. Each of the entries 

25 stored in the equivalent table N6_t has a format composed of 
a 2-bit table node information flag field FG, a tail entry flag 
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field, a data bit string field, a search bit length field, and 
a related information field. Each data bit string is obtained 
by addinq "o" to a correspondlnq item of data so mat the tit 
length thereof is equal to the maximum bit length of search data. 
5 The equivalent table N6_t formed as described above is stored 
at thft location 1nd1 eated by the pointer to the node N6 on 
the memory 44 . 

As described above, formation o± an equivalent table 
according to the present embodiment can achieves a reduction 
10 of necessary amount of memory without increasing In maximum 
search time. In addition, the present embodiment can reduce 
the number of addresses to store the data structure. Assuming 
tbat the number of bits Pnr eanb address is 32. the* nnnvp.n1* i nna 1 
data structure as shown in Fig. 4 needs 28 addresses because 
is two addresses are needed ro store data tor each of 14 nodes. 

Contrarily, according to the present embodiment, as show 
In Fig. 12, the number of nodes Is 9. the number of entries ia 
the equivalent table N6_t is 2, tha necessary number of addresses 
to store data for one node is 2, and the necessary number of 
2U addresses to store data for one entry is 1. Accordingly, 

the necessary number of addresses to store the data structure 
as shown in Fig. 12 is 20 (= 2 x 9 + l x 2), Therefore, compared 
with the conventional data structure, the present embodiment 
allows Lhe uumbex of addiesbes iu the uiewyjty to leduue by about 

75 



